Rank in Wordlist | Frequency | Word |
---|---|---|
3669 | 1 | 0,2 |
3685 | 1 | 1,2 |
3698 | 1 | 12,4-wa |
3712 | 1 | 15,8 |
3750 | 1 | 2,5 |
3751 | 1 | 2,5-wa |
3752 | 1 | 2,9 |
3838 | 1 | 6,7 |
3861 | 1 | 8,1 |
4495 | 1 | France,1988 |
Rank in Wordlist | Frequency | Word |
---|---|---|
1505 | 4 | eds)(1999 |
3682 | 1 | 1(Mudyaxihi |
3683 | 1 | 1)(b |
3714 | 1 | 16(2)(a |
3782 | 1 | 3(2 |
3783 | 1 | 3(b |
3805 | 1 | 40(1):71-108 |
3818 | 1 | 5(1)(a)(i |
3819 | 1 | 5(1)(a)(ii |
3820 | 1 | 5(1)(b)(i |
Rank in Wordlist | Frequency | Word |
---|---|---|
1505 | 4 | eds)(1999 |
3683 | 1 | 1)(b |
3714 | 1 | 16(2)(a |
3805 | 1 | 40(1):71-108 |
3818 | 1 | 5(1)(a)(i |
3819 | 1 | 5(1)(a)(ii |
3820 | 1 | 5(1)(b)(i |
3821 | 1 | 5(I)(a)(ii |
5440 | 1 | PTY)LTD |
6768 | 1 | ed)(1977 |
Rank in Wordlist | Frequency | Word |
---|---|---|
2526 | 2 | Mail&Guardian |
3998 | 1 | B&B |
4970 | 1 | M&G |
Rank in Wordlist | Frequency | Word |
---|---|---|
9106 | 1 | vaku:"U |
Rank in Wordlist | Frequency | Word |
---|---|---|
594 | 10 | McGregor's |
705 | 9 | swin'wana |
917 | 7 | n'wi |
1078 | 6 | rin'wana |
1085 | 6 | swin'we |
1289 | 5 | tin'wana |
1294 | 5 | van'wana |
1551 | 4 | man'wana |
1655 | 4 | vun'we |
2065 | 3 | n'hweti |
Rank in Wordlist | Frequency | Word |
---|---|---|
3684 | 1 | 1+3 |
Rank in Wordlist | Frequency | Word |
---|---|---|
3758 | 1 | 2010/11 |
3928 | 1 | AidsArt/South |
4091 | 1 | Breath/Tony |
4390 | 1 | East/Rini |
4506 | 1 | GNU/GPL |
4683 | 1 | Ibrahim/McGregor |
5089 | 1 | McGregor/Archie |
5838 | 1 | Studies/Teologiese |
6083 | 1 | Vatsonga/Machangana |
6128 | 1 | Vredenburg/Paternoster |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots